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SUMMARY 

The  set  of  all  bivariate  distributions  with  support  contained  in 
((i,j);  i  =  1,2  and  j  =  l,2,***,n)  which  are  positive  quadrant  dependent 
is  a  convex  set.  In  this  paper,  an  algebraic  method  is  presented  for 
the  enumeration  of  all  extreme  points  of  this  convex  set.  Certain  measures 
of  dependence,  including  Kendall's  tau,  are  shown  to  be  affine  functions 
on  this  convex  set.  This  property  of  being  affine  helps  us  to  evaluate 
the  asymptotic  power  of  tests  based  on  these  measures  of  dependence  for 
testing  the  hypothesis  of  independence  against  strict  positive  quadrant 
dependence.  . 


Keywords  :  POSITIVE  QUADRANT  DEPENDENCE;  CONVEX  SET;  EXTREME  POINTS;  MEASURES 
OF  DEPENDENCE;  KENDALL'S  TAU;  SOMER'S  d;  PEARSON'S  CORRELATION 
COEFFICIENT;  SPEARMAN'S  RHO;  HYPOTHESIS  OF  INDEPENDENCE;  POWER  OF 
A  TEST;  ASYMPTOTICS 


1.  INTRODUCTION  AND  PRELIMINARIES 

A  good  understanding  of  the  nature  of  dependence  among  multivariate 
probability  distributions  is  useful  for  modelling  multivariate  random  phenomenon. 
There  is  a  multitude  of  dependence  notions  available  in  the  literature.  For  a 
good  account  of  these  notions,  one  may  refer  to  Lehmann  (1966),  Barlow  and 
Proschan  (1981),  and  Eaton  (1982).  In  this  paper,  we  concentrate  on  one  particular 
notion  of  dependence,  namely,  that  of  positive  quadrant  dependence  among  discrete 
bivariate  distributions  and  examine  the  structure  of  such  distributions  by 
performing  an  extreme  point  analysis.  We  now  describe  the  problem.  Let  X  and  Y 
be  two  random  variables  with  some  joint  probability  distribution  function  F. 


Assume  that  X  takes  only  two  values  1  and  2,  and  Y  takes  n  values  1,2, •• 
The  joint  probability  distribution  of  X  and  Y  can  be  described  by  a  matrix 


P  =  (p„)  of  order  2xn,  where  =  Pr(X  =i,  Y  =  j),  i  =  1,2  and  j  *  1,2 

n.  The  random  variables  X  and  Y  are  said  to  be  positive  quadrant  dependent 
(equivalently,  F  or  P  is  said  to  be  positive  quadrant  dependent)  (PQD)  if 

Pr(X  <  i,  Y  <  j)  >  Pr(X  <  i)  Pr(Y  <  j) 
for  all  i  and  j.  Let  p^  =  Pr(X  =  i),  i  =  1,2  and  q^  =  Pr(Y  =  j),  j  =  1,2 
•••,n.  The  above  condition  can  be  phrased,  equivalently,  as  follows. 


pll 

pll 

+  p12  ±  p] 

.(ql 

+  q2) 

(1.1) 

pll 

+  Pi2  +  P13 

> 

Pl(ql  +  q2  +  q3) 

P11  +  p12  +  “•  +  pln-l  -  pl(ql  +  q2  +  ”•  +  qn-l)< 

Let  denote  the  collection  of  all  bivariate  positive  quadrant  dependent 

distributions  with  support  contained  in  {(i»j)  »  i  =  1,2  and  j  =  l,2,***,n}. 
This  set  is  not  a  convex  set.  An  example  of  two  bivariate  distributions  P^ 

and  *-n  Mpqd  an<*  a  num^er  0  <  1  <  1  such  that  AP^  +  (l-A)?^  I  Mpg^ 

are  easy  to  find.  However,  if  we  fix  the  marginal  distributions  of  X  and  Y, 

the  above  set  becomes  a  convex  set.  More  precisely,  let  p^  and  p£,  and 

q^,  q2»  ...  ,  qn  be  two  sets  of  non-negative  numbers  satisfying  P]^  +  P2  = 

I  =  qL  +  q2  +  ...  +  qn.  Let 

MPQD(pl*P2;ql*q2*  ”  *qn}  =  =  (PiJ)  eMPQD;  Plj+P2j=qj  f°r 

j  =  1,2,  ...,n  and  p  +p,?+  •••  +  p. 


Under  the  bivariate  distribution  P  in  MpQQ(p1 ,p2;qi ,q2> • • • .q^) ,  X  and  Y 

are  positive  quadrant  dependent,  X  has  the  marginal  distribution  p  ,  p2, 

and  Y  has  the  marginal  distribution  q^ >^2’ ‘ * ’ »£ln*  This  set  has  nice 
properties. 

Theorem  1 .  The  set  Mpqp(pj ’^2*^1 ’^2* *  *  * *^n^  *s  comPact  an<^  convex.  More 
strongly,  this  set  is  a  simplex. 

Proof.  The  convexity  follows  from  the  inequalities  (1.1).  That  the  set  is  a 
simplex  is  obvious. 

Even  though  the  set  is  not  convex,  it  can  be  written  as  a 

union  of  compact  convex  sets,  i.e., 

MPQD  =  U  MPQD(pl'P2;Vq2,’"’qn)* 

where  the  union  is  taken  over  all  p^,p2;  ^1 ’^2’  ’  ’  ,(*n  non-ne8ative 

numbers  with  P:  +  P2  *  1  =  ^  +  q2  +  *  *  *  +  qn-  The  fact  that  mPqD(p1»P2! 

^1 ’^2’ " *  * ,C*n^  *s  3  s^mP^ex  impli®s  that  it  has  only  a  finite  number  of 
extreme  points,  and  that  every  member  of  this  set  can  be  written  as  a  convex 
combination  of  its  extreme  points.  Looking  at  the  notion  of  positive  quadrant 
dependence  from  a  global  point  of  view  as  enunciated  abc'e  is  useful.  Some 
of  the  applications  given  in  Section  3  amplify  this  point.  A  good  understanding 
of  the  nature  of  extreme  points  will  provide  a  deep  insight  into  the  mechanism 
of  positive  quadrant  dependence  of  two  random  variables. 

One  of  the  goals  of  this  paper  is  to  present  an  algebraic  method 
to  find  all  the  extreme  points  of  the  convex  set  MpQD(p^ ,p2 ;q^ ,q2» • • • .q^) . 

The  cases  n  =  2  and  3  were  discussed  by  Nguyen  and  Sampson  (1985)  in  the 
context  of  examining  the  position  of  the  set  Mp^D(p1  .p^q^  ,q0  ,  •  •  •  .q^)  in 
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the  class  of  all  bivariate  distributions.  These  cases  were  also  discussed  by 
Bhaskara  Rao,  Krishnaiah  and  Subramanyam  (1987)  in  the  spirit  this  paper  is 
concerned  with.  Some  measures  of  dependence,  especially,  Kendall's  Tau,  between 
pairs  of  random  variables  will  be  discussed  in  the  light  of  the  convexity 
property  of  the  set  ^pqd^Pi  *^2’ *  *  * ’^n^  ‘  The  structure  of  this  set  will 

be  exploited  to  examine  properties  of  certain  tests  of  independence.  The  affine 
property  of  certain  measures  of  dependence  makes  it  easy  to  evaluate  the 
asymptotic  powers  of  tests  based  on  these  measures  of  dependence. 

2.  ON  EXTREME  POINTS 

Without  loss  of  generality,  assume  that  each  of  the  numbers  p^,  p^, 
q^,  ^2*  * *"  »  9n  *s  Positive*  The  following  result  characterizes  members  of 
^PQD^l  ,C*2’  *  *  *  ,C*n^  ‘  result  is  useful  to  develop  an  algebraic  method 

for  the  extraction  of  all  the  extreme  points  of  Mpg^(p^ .p^jq^ ,q2» ’ * ’ »9n) •  Some 
notation  is  in  order.  For  any  two  real  numbers  a  and  b,  a  V  b  denotes  the 
maximum  of  a  and  b,  a  A  b  denotes  the  minimum  of  a  and  b. 

Theorem  2.  Let  P  =  (p„)  E  ^PQD^l  ,P2ICI1  ,C*2’  *  "  ,qn^ '  ^hen  the  numbers  p^, 

Pl2>  Pjn_j  satisfy  the  following  inequalities. 

piqi  -  pn  i  pi  A  qi 

pn  v  Pl(<ll  +  q2)  i  pll  +  p12  i  pl  A  <pll  +  q2) 

(2.0  (pu  *  p12>  V  PjCq,  +  q2  +  q3)  <  pu  *  p12  +  pn  <  Pj  A  <pn  +  p12  +  q3) 


Conversely,  suppose  P^»  P^2>  *'*  »  P ^n- 1  are  (n_l)  numbers  satisfying 
the  above  inequalities.  Let  p^n  =  p^  -  (p^  +  p^2  +  +  P^n  ^)  and 

P2j  =  *  PXj.  J  =  1  *2.  *  *  *  »n.  Then  the  matrix  P  =  (p_)  £  mPqD(P1*P2; 

qj.q2.--.Qn>. 

Proof.  Some  comments  are  in  order  on  the  above  result.  Since  the  row  and 
column  sums  of  matrices  P  =  (p„)  in  ^pqd^Pi,P2J^1,^2*  *  *  *  ,cin^  are  ^xec*’ 
knowledge  of  the  (n-1)  numbers  p^,  p^»  ***  »  Pjn_^  is  sufficient.  The 
remaining  entries  of  P  can  be  determined  by  appropriate  subtractions. 

Coming  to  the  proof,  we  observe  that  since  P  is  positive  quadrant  dependent, 

P11  +  P12  +  **’  +  pli  -  pl^ql  +  q2  +  ***  +  qi^  for  *  =  1 » 2, *  *  * ,n-l .  Since 
the  entries  of  P  are  non-negative,  p^  +  p^  +  +  P^  ^  p^  +  p^2  + 

•••  +  p^^  f°r  i  “  2,3,***,n-l.  These  two  sets  of  inequalities  establish 

the  validity  of  the  inequalities  on  the  left  hand  side  of  (2.1).  In  view  of 
the  marginality  restrictions,  it  follows  that  p^  +  p^  +  •••  +  p^  <  p^ 
and  also  <  p. .  +  p._  +  •••  +  p,.  ,  +  q.  for  i  =  l,2,***,n-l.  Thus  we  see 
the  validity  of  the  inequalities  on  the  right  hand  side  of  (2.1).  Conversely, 
if  Pjj»  Pi2’  >  Pin-i  satisfy  the  inequalities  (2.1),  it  immediately 

follows  that  p^,  >  0  and  p^  _>  0  for  j  =  2,3,***,n-l.  From  the  inequalities 

on  the  right  hand  side  of  (2.1),  it  follows  that  each  of  p^j ,  j  =  1 , 2 , • • * ,n- 1 

is  non-negative.  Observe  also  that  p^n  is  non-negative.  It  remains  to  be 
shown  that  P2n  is  non-negative.  Since  p^n  =  q^  -  p^n  =  q^  -  (p^  -  (p^  + 

p12  +  *'■  +  P^- 1 ^  =  qn  '  P1  +  (pll  +  p12  +  ***  +  Pln-1^’  it:  suffices  to 

show  that  p^  +  p ^ ^  +  **•  +  P^n_^  >  p^  ■  qn-  From  the  last  inequality 

of  (2.1),  it  follows  that  p^  +  p^  +  ***  +  p^  ^  >  p^q^  +  q2  +  •••  +  q  ^) 

=  p^(l  "  qn)  =  p^  -  P^n  1  P^  ~  qn-  It  is  now  clear  that  the  matrix  P  =  (p  — 

has  the  required  marginal  sums  and  that  P  is  positive  quadrant  dependent. 

This  completes  the  proof. 
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An  algebraic  method  for  the  identification  of  the  extreme  points  of 
MPQD^prp2,;qrq2,‘‘*>qn^ 

The  following  proposition  in  conjunction  with  Theorem  2  above 
would  form  the  basis  of  the  algebraic  method  we  intend  to  develop. 

Proposition  3.  Let  a,.  a„,  ,a  and  b. .  b„,  •••  ,  b  be  2m  fixed 

-  1  l  m  l  2  m 

numbers.  Let  M  be  the  set  of  all  vectors  (x^ .x^, • • * ,x  )  satisfying 


ai  i  xi  i  bi 
a2  —  X1  +  x2  —  b2 


(2.2) 


a3  i  X1  +  x2  +  x3  -  b3 


a  <  x,  +  x„  + 

m  -  1  2 


+  x  <  b  . 
m  —  m 


Then  M  is  a  compact  convex  set  and  every  extreme  point  (x*,x*, • • • ,x*)  of 

i  2  m 

M  satisfies  x*  +  x*  +  •••  +  x*  =  a.  or  b.  for  each  i  =  l,2,***,m. 

12  ill 

Proof .  It  is  obvious  that  M  is  a  compact  convex  set.  Assume  that  M  is 

non-empty.  Let  (x*,x£ . x*)  t  M.  We  will  show  that  if  x?  +  x£  +  . .  •  +  x* 

l  2  m  l  2  i 

is  neither  equal  to  a.  nor  to  b,  for  some  i,  then  (x*,x£ . x* )  is 

ii  1  /  m 

not  an  extreme  point  of  M.  Let  i  be  the  largest  index  in  {l,2,...,m}  such 

that  x?  +  x'4  +  ...  +  x*  is  neither  equal  to  a.  nor  to  b..  Let  xj  =  xl1 
12  l  ^  l  il2 

*  x*;  x'  =  x"  =  x*;  ...  ;  x!  .  =  x'.'  =  x*  ,  .  Solve  the  following  equations 

1222  l-l  l-l  l-l  °  n 


x!  +  xl  +  +  x!  =  a. 

12  ii 

and 

x7  +  x"  +  **•  +  xV  =  b. 

12  ii 

for  x!^  and  x? ,  respectively.  Then  we  can  write  x?  =  xx|  +  (l-x)xV  for 
for  some  0  <  \  <  1.  Now  solve  the  following  equations 


or  the  equations 

xJ  +  xi  4-  •••  +  x!  =  b. 
12  J  J 

and 


4*  XM  ^  x'*  s:  Jj 

12  j  j 

if  x*  +  x*  +  • • •  +  x*  =  b . 
12  J  J 


for  j  =  i+1,  i+2,  •••  ,  m,  successively.  It  is  now  clear  that  (xj ,x' , • • • ,x' ) 

i  z  m 


and  (xV.x", • • • ,x")  are  distinct,  belong  to  M,  and  (x*,x*, • • • ,x*)  = 

i  z  m  L  L  m 

A(x! ,xi, • • • ,x' )  +  (1-A)(x,.,»x!>',  •  •  •  ,x").  Hence  (x*,x*,  •  •  •  ,x*)  cannot  be 

iZm  I  Z  m  XZm 

an  extreme  point  of  M. 

The  extreme  points  of  M  are  easy  to  find.  Set  the  central 
expression  in  each  of  the  inequalities  (2.2)  equal  either  to  the  quantity 
on  the  right  or  to  the  quantity  on  the  left,  and  then  solve  the  resultant 
system  of  equations  in  x^,  x^,  •••  ,  x^.  These  systems  of  equations  are 
easy  to  solve  recursively.  A  crude  upper  bound  to  the  total  number  of 
extreme  points  of  M  is  2m.  The  system  of  inequalities  (2.1)  bear  a 
close  resemblence  to  the  system  of  inequalities  (2.2).  The  presence  of  the 
symbols  V  and  A  in  the  system  (2.1)  seem  to  complicate  the  problem 
of  finding  the  extreme  points  of  >P2;<li  »<:^2,  *  *  *  or<^er  t0 

reduce  the  system  of  inequalities  (2.1)  to  the  one  of  the  form  (2.2), 
we  must  find  a  way  of  getting  rid  of  the  symbols  V  and  A  from  (2.1). 
This  can  be  achieved  by  splitting  each  inequality  in  the  system  (2.1) 


»< 

y 


f 


h 
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into  sub- inequalities,  if  necessary,  using  the  subsequent  inequalities 
to  get  rid  of  the  symbols  V  and  A.  Eventually,  we  should  be  able  to 
get  systems  of  inequalities  of  the  type  (2.2)  equivalent  to  (2.1).  We 
will  explain  this  method  by  working  out  some  examples. 

Example  1  The  case  when  n  =  2. 

The  determining  inequalities  (2.1)  become 

piqi  -  Pn  -  pi  A  qr 

There  are  only  two  extreme  points  of  MpQD(Pl >P2;cl1  *  If 

p^  £  q^ ,  these  are  given  by 


Piqi 

Plq2‘ 

P1 

0  ' 

P1  - 

P2ql 

P2q2. 

and  P2  = 

qrpi 

q2 

If  <1!  IP,. 

the  extreme  points  are 

Plql 

Plq2' 

’qi 

prqi" 

P1  = 

_P2ql 

P2q2 

and  P2  = 

0 

p2 

Example  2  The  case  when  n  =  3. 

The  determining  inequalities  (2.1)  become 


Plql  i  P11  i  Pi  A  ql 
and 

P11  V  pl(ql  +  q2}  -  pll  +  Pi2  -  P1  A  (Pll  +  q2} ' 

Since  there  are  only  two  variables,  this  system  of  inequalities 


can  be  solved  graphically.  As  an  illustration,  let  =  0.3,  p^  =  0.7 
and  =  0.5,  =  0.3,  q^  =  0.2.  The  inequalities  (2.1)  become 

0.15  <  pn  <  0.3 

and 

Pu  V  0.24  <  Pu  +  P12  1  0.3. 

The  maximum  symbol  V  in  the  second  inequality  above  can  be  eliminated 
by  splitting  the  first  inequality  into  two  parts  :  0.15  <  p^  <0.24 

and  0.24  <  p^  £  0.3.  The  above  system  of  inequalities  is  equivalent 

to  the  following  two  systems  of  inequalities. 

O.iS  1  pu  <  0.24 

(2.3)  and 

°-24  1  Pu  +  P12  1  °-35 
or 

O-24  1  Pu  I  0.3 

(2.4)  and 

P11  -  P11  +  P12  -  °-3 

in  the  sense  that  if  p^  and  p^  satisfy  the  given  system  (2.1)  of 
inequalities,  then  p^  and  p^  satisfy  either  the  system  (2.3)  or 
the  system  (2.4).  The  graph  of  these  two  systems  of  inequalities  is  given 


below. 


0.15  0.2  0.24 


Graph  of  the  inequalities  (2.3)  and  (2.4) 


The  extreme  points  of  the  convex  set  Mpgp(0 • 3 ,  0. 7 ;0 . 5 ,0 . 3 , 0 . 2 )  can  be 
read  off  directly  from  the  above  graph.  These  are  given  by 


"0.15 

0.09 

0.06' 

'0.15 

0.15 

0.00' 

1—* 

II 

0.35 

0.21 

0. 14j 

P2  = 

0.35 

0.15 

0 . 20_ 

0.30 

0.00 

0.00' 

’0.24 

0.00 

0.06' 

P3  = 

0.20 

0.30 

0.20J  ’ 

II 

0.26 

0.30 

0.  U 

One  could  use  the  algebraic  method  described  following  Proposition  3  to  find 
the  extreme  points  using  the  two  systems  of  inequalities  (2.3)  and  (2.4). 
After  weeding  out  the  duplicates  and  ncm-extreme  points,  one  gets  the  same 
four  matrices  detailed  above. 


The  case  when  n  =  4. 


The  determining  inequalities  (2.1)  become 


< 


Pll  V  Pl(qx  +  q2)  £  pn  +  P12  £  Pj  A  (pn  +  q2). 

and 

(pll  +  P12)  V  Pl(ql  +  q2  +  q3>  i  P11  +  p12  +  P13  - 

Pi  A  (pu  +  p12  +  q3 


For  a  specific  illustration,  take  =  0.6,  p2  =  ^.4  and  =  0.2,  =  < 


q4  =  0.2. 

The 

inequalities 

then 

reduce 

to 

0.12 

£  PU  1 

0.2, 

(2.5) 

0.36 

£  PU  +  P 

12  1 

P11  + 

0.4, 

and 

(pll 

+  p12)  v  o. 

48  £ 

P11  + 

p12 

The  symbols  V  and  A  are  present  only  in  the  last  inequality  above.  They 
can  be  eliminated  by  splitting  the  first  two  inequalities  carefully.  The 
system  (2.5)  is  equivalent  to  the  following  three  systems  of  inequalities 


0-12  <  pn  £  0.2, 

°’36  i  P11  +  P12  -  °-4’ 

and 

0.48  <  pu  +  p12  +  p13  £  pn  +  p12  +  0.2; 

or 

0.12  <  pn  £  0.2, 


°-4  £  P,i  + 


<  0.48, 


°-48  -  Pll  +  p12  +  P13 


<  0.6; 


or 

0.12 

-  pll  - 

0.48 

-  pll  +  p 

and 

+  0.4, 


P11  +  p12  -  P11  +  P12  +  P13  -  °-6‘ 

For  each  system  above,  one  can  find  its  extreme  points  algebraically  by 
setting  the  central  expression  in  each  inequality  to  either  the  quantity 
on  the  right  or  to  the  quantity  on  the  left.  After  weeding  out  the  repetitions 
and  non-extreme  points,  one  obtains  the  following  as  the  collection  of  all 
extreme  points  of  Mp^CO. 6 , 0. 4;0 . 2,0 .4,0. 2 ,0. 2) . 


0.12 

0.24 

0.12 

0.12' 

fO.  12 

0.24 

0.20 

0.04 

P1  " 

0.08 

0.16 

0.08 

0 . 08_ 

i 

P2  = 

0.08 

0.16 

0.00 

0.16 

'0.12 

0.28 

0.20 

0.00' 

0.20 

0.16 

0.12 

0.12 

P3  = 

0.08 

0.12 

0.00 

0.20 

9 

P4  " 

0.00 

0.24 

0.08 

0.08 

0.20 

0.16 

0.20 

0.04' 

0.20 

0.20 

0.08 

0.12' 

P5  = 

0.00 

0.24 

0.00 

0.16 

9 

P6  " 

0.00 

0.20 

0.12 

0.18 

"0.20 

0.20 

0.20 

0.00' 

0.12 

0.36 

0.00 

0.12 

***s| 

II 

0.00 

0.20 

0.00 

0 . 20_ 

9 

P8  = 

0.08 

0.04 

0.20 

0.08 

'0.12 

0.36 

0.12 

0.00' 

0.20 

0.28 

0.00 

0.12 

p„  = 

. 

P  = 

9 

0.08 

0.04 

0.08 

0. 20_ 

9 

10 

/; .  00 

0.12 

0.20 

0.08 

0.20 

0.28 

0.12 

0.00' 

0.12 

0.40 

0.00 

0.08 

P,  ,= 

• 

P  = 

11 

0.00 

0.12 

0.08 

0.20 

> 

12 

0.08 

0.00 

0.20 

0. 12_ 

/,  <T.  S  J 


■.r  V 


Q 


Remarks .  For  the  general  case  of  n,  identification  of  extreme  points 
can  be  carried  out  in  a  similar  vein  as  outlined  above. One  can  show  that 


the  joint  distribution  under  which  X  and  Y  are  independent  is  always 

an  extreme  point  of  MpQ[.(p1 , p2 ;q1  ,q2 ,  •  •  • , q^) . 


3.  SOME  APPLICATIONS 


( 1 )  On  the  affine  property  of  certain  measures  of  dependence 

In  the  literature,  there  are  several  measures  of  dependence 
proposed  to  study  the  degree  of  dependence  between  two  random  variables. 
For  details,  one  may  refer  to  Agresti  (I98A,  Chapter  9)  or  Goodman  and 
Kruskal  (1979).  In  this  section,  we  examine  the  following  problem.  For 
any  bivariate  distribution  P,  let  A(P)  denote  a  measure  of  dependence 
proposed.  If  P^  and  P2  are  two  bivariate  distributions  in  Mpqi/Pi,P2* 
,q2» *  *  * »qn)  and  0  <  A  <  1,  is  it  true  that 

A(.\P1  +  (1-A)P2)  =  AA(Pl)  +  (1-A)A(P2)? 

If  it  is  so,  A( • )  can  rightly  be  called  an  affine  measure  of  dependence 
°n  MPQD^PrP2;ql'q2,*“,qn^‘ 


Let  P  *  (p..)  be  any  bivariate  distribution  of  order  2xn.  Let 
ij 


nc  =  2[pu(p22  +  p23  + 


+  P2n}  +  p12(p23  +  P2A  + 


+  p2n} 


•”  +  Pln-lP 


ln-r2n 


and 


•‘v*' 


v 
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"d  =  2lpln  <p21  +  p22  +  •"  +  p2n-l)  +  pl„-l(p21  +  p22  *  +  ',2n-2) 

+  +p12>’21|- 

For  any  P  in  Mp^(p^ , ;q^  ,q0 ,  •  •  •  .q^) ,  the  following  are  some  of  the 
measures  of  dependence  commonly  used  in  the  literature. 


Kendall's  Tau-b 

Somer's  d  (Y  on  X) 
Somer's  d  (X  on  Y) 


Tb(p)  =  (nc  -  nd)/[(l-p2-p2)(l-q2-q2 . 

dYX(p)  =  (nc  -  nd)/(i-pj-p2) 
dXY(P)  =  (IIc  '  nd)/(1'Vq2 . ft 


Pearson's  Correlation  , 

Coefficient  p(P)  =  Covp(X,Y)/[Varp(X)  Varp(Y)]* 

Spearman's  rho  P„(P)  =  [I?  .  En  .  ( rX  -  0.5)(rY  -  0.5)d..1  + 

-  S  1=1  3=1  1  j  *13 

{[zi=i(ri  "  °-5)2Pi](zj=i(rj  -  °-5)2qjJ}1. 
where  r*  =  pfc  +  p./2,  i  =  1,2 

and  rY  =  qg  +  q  /2,  j  =  l,2,--*,n. 


Theorem  4  All  the  measures  of  dependence  described  above  are  affine  on 
the  set  MPQD(P1*P2;cl1  •q2’  ’"^n^' 


Proof .  That  the  measures  p( • )  and  p<,(*)  are  affine  is  obvious  since 

these  measures  are  linear  functions  of  the  joint  probabilities  and  the 

marginals  are  fixed.  For  the  affine  property  of  • )  and  d^.f*), 

it  suffices  to  show  that  fl  -  n.  is  a  linear  function  of  the  joint 

c  d  J 

probabilities.  By  substituting  p0^.  =  q ^  -  Pjj>  j  =  l»2,***,n,  one  can  easily 
verify  that 


n  -  n  =  2 


„n- 1 

"3=1  Plj 


,n-  1 


2  Zj  =  l  (ql+V-^j)(Plj+Plj  +  l 


This  completes  the  proof. 


Remarks  In  view  of  the  above  theorem,  it  suffices  to  compute  the  measure  of 
dependence  for  the  extreme  point  distributions  of  Mp^^Cp^ .p^jq^ ,q2» ' * ' ><ln) 
for  any  of  the  measures  of  dependence  described  above.  The  measure  of 
dependence  for  any  distribution  P  in  »P2;ql ,q2‘ * ’ * ’^n^  is  a 

convex  combination  of  the  corresponding  measures  of  dependence  for  the 
extreme  point  distributions  in  Mp^(p^  ,q2 > '  *  *  »qn)  •  we  will  make  use 

of  this  property  in  computing  asymptotic  power  of  certain  tests  of  independence 

(2)  Computation  of  the  power  function  of  tests  of  independence 

Suppose  X  and  Y  are  two  random  variables  such  that  X  takes 
only  two  values,  say,  1  and  2,  and  Y  takes  n  values,  say,  l,2,***,n. 

Assume  that  the  marginal  distributions  of  X  and  Y  are  known  given  by 
Pj,P2  and  q^^,  *  *  ‘  »qn*  resPecti-vely-  The  joint  distribution  P  =  ( p^^  > 
of  X  and  Y  is  unknown  but  known  to  be  positive  quadrant  dependent.  Suppose 
we  wish  to  test  the  validity  of  the  hypothesis 

Hq  :  X  and  Y  are  independent 
against 

:  X  and  Y  are  strictly  positive  quadrant  dependent 

based  on  N  independent  realizations  (X^,Y^),  ^2^2),  •••  ,  (X^.Y^)  of 
(X,Y).  The  null  hypothesis  Hq  is  simple  and  the  alternative  is 

composite.  Let  P^  =  (p^q^).  Then  Hq  and  can  be  written  equivalently  as 
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If  T  is  any  test  proposed  for  Hq  against  H^,  one  could  use  the  substance 
of  Theorem  1  to  give  a  simple  expression  for  the  power  function  of  T.  Let 

(^(P)  =  Pr(T  rejects  Hq  |  The  joint  distribution  of  X  and  Y  is  P), 
for  P  e  MpQD(pl,p2;ql,q2,"’,qn) 

be  the  power  function  of  T.  Obviously,  p^CPf)  is  the  size  of  the  test  T. 
We  explain  how  P^,(P)  can  be  simplified  when  n  =  2.  Let  P^  =  (p^q^)  and 
P2  be  the  extreme  points  of  MpQ[)(p1 ,  p2  ;q1  ,q2> .  Let  P  e  Mpqd^p1 ,p2 ;ql ,q2^ ' 
Then  we  can  write  P  =  AP^  +  ( 1  -  A )  P2  for  some  0  £  A  <_  1.  If  the  joint 
distribution  of  X  and  Y  is  P,  we  denote  the  joint  distribution  of  the 

random  sample  (X^,Y^),  (X2>Y2),  •••  ,  (X^.Y^)  by  P*^.  (PN  does  not 

N 

mean  that  the  matrix  P  is  multiplied  by  itself  N  times.  P  is  the 
product  probability  measure.)  Then 


PN  =  [AP1  +  ( 1-A)P2 ]N  =  (^)  Ar  (l-A)N_r(pJ  •  P2'r), 

r  M~r 

where  P^  ®  P2  is  the  joint  distribution  of  (X^.Y^),  (X2>Y2),  •••  , 

(X^,Y^)  with  (X^,Y^),  (X9,Y2),  •••  ,  (X^,Y^)  independently  distributed, 

(Xj.Yj),  (X2,Y9),  •••  ,  (X^.Y^)  identically  distributed  with  common 
distribution  P^,  and  (X^.Y^),  ^Xr+2 ,Yr+2^ ’  ***  *  (X^Y^  identically 
distributed  with  common  distribution  P2<  We  can  extend  the  definition  of 
P^,(P)  to  encompass  this  situation.  Let 


BT(P[  «  P2"r)  =  Pr(T  rejects  Hq  |  (X^Y^,  (X2,Y2),  •••  ,  (Xr,Yr)  have  the 

common  distribution  P^  and  ^r+i,Yr+i^’ 

(Xr+2’Yr+2} '  *  (VV  have  the 

common  distribution  P., ) , 


for  r  =  0, 1 ,2, • • • ,N. 


N 

In  this  new  notation,  @^,(P)  =  3^,(P  )• 

The  following  result  is  obvious. 

Theorem  5  For  any  P  in  Mpqi)(Pi  >P2:qi  ,q2>  ’ 

PT(P)  =  ^=0  (J)  Ar  (l-X)N_r  0T(P[  •  P?J_r)t 

where  P  =  AP^  +  (l-A)?^  with  0  <_  A  <_  1 . 


IT  N 

Remarks .  In  view  of  the  above  result,  it  suffices  to  compute  3^,(P^  ®  P2 
for  r  =  0,1,2, 3^,(P)  is  a  convex  combination  of  these  numbers 
for  any  P  in  MpQ^Cp^ ,P2»qi ^2) *  This  formula  is  also  useful  if  one 
wishes  to  compare  the  power  functions  of  any  two  tests.  This  theorem 
has  been  exploited  by  Bhaskara  Rao,  Krishnaiah  and  Subramanyam  (1987)  to 
compare  the  performance  of  some  tests  in  the  case  of  2x3  bivariate 
distributions  in  small  samples.  For  small  sample  sizes  N,  computation 
of  power  functions  of  tests  and  comparison  of  power  functions  of  tests 
can  be  carried  out  effectively  using  Theorem  5. 


We  now  wish  to  emphasize  that  the  affine  property  of  certain 

measures  of  dependence  also  helps  in  evaluating  the  power  function 
1 

asymptotically  for  some  tests  built  on  these  measures  of  dependence. 

As  an  illustration,  consider  the  test  based  on  Kendall's  Tau-b.  Let 

,  0..  =  //{(X  ,Y  )'s;  X  =  i  and  Y  =  j}, 

I  ij  r  r  r  r  J 

I 

|  for  i  *  1,2  and  j  =  1,2. 

I 

'  The  data  (X.,Y.),  (X_,Y_),  •••  ,  (X...Y..)  can  be  summarized  in  the 

'  1  1  l  L  N  N 


form  of  a  contingency  table  as  follows. 


An  estimator  of  Kendall's  Tau-b  is  given  by 

*b >N  =  (2/N2)(0n022  -  012021)/[(l-p2-p2)(l-q2-q2)]i. 

Test  based  on  Kendall's  Tau-b 

T  :  Reject  Ho  if  and  only  if  x^  ^  >  c, 

where  c  is  the  critical  value  of  the  test  T 
which  depends  on  the  given  level  of  significance. 

We  now  evaluate  the  power  function  of  the  test  T  asymptotically 
Assume,  without  loss  of  generality,  that  p^  <  q^.  The  extreme  points  of 

fWpi’p2;qrq2)  are 


plql 

Plq2 

and  P_  = 

pl 

0  ■ 

P2ql 

p2q2 

qrpi 

q2. 

Note  that  x^P^  *  0  and  xfa(P2)  =  2plql^  1”pl~p2^1"ql"q2^  =  Say’ 
Then  for  any  P  in  MpQ[)(p1  ,p2;q1  ,q2)  with  P  =  ApP1  +  (l-Xp)P2  and 
0  £  Ap  £  1 ,  it  follows  that  x^CP)  =  (1-Ap)6. 


The  asymptotic  power  function  of  T  has  the  following  structure. 


hujjuw.wj1.  ipjipjpjmwu ■ i*  v  mu  uu  w  v.wwi*  .■«  »•"  v » '■•wv'.rw'n  '.vw.n  ww 
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Theorem 


_6  Let  Pe  MpQD(p1,p2;q1,q2). 


t-: 

t- 

i 


i 4 


I*' 

r* 


: 


(i)  If  c  <  0, 

then  lim^  w 

PT(P) 

=  0. 

(ii)  If  c  >  0, 

then  1  ^  M 

PT(P) 

=  1 

if  ( 1-Ap)6  >  c 

=  0 

if  (l-Ap)6  <  c. 

Proof .  Observe  that  by  the  Strong  Law  of  Large  Numbers 
N  converges  to  -^(P)  = 

almost  surely  as  N  -*■  00  under  the  joint  distribution  P  of  X  and  Y. 
From  this,  the  desired  conclusion  follows. 

Remarks .  It  is  heartening  to  note  that  the  size  of  the  test  T  converges 
to  0  as  N  -*  00  for  c  >  0.  For  P  #  P^,  one  would  like  to  have  the 
power  3^,(P)  to  converge  to  1  as  N  -*■  00 .  As  is  to  be  expected,  this 
is  not  the  case.  The  hypotheses  Hq  and  are  not  well  separated.  If 
the  joint  distribution  P  is  very  close  to  Pp  the  power  3^,(P) 
converges  to  0  as  N  -*■  °°.  However,  if  there  is  reason  to  believe  that 
the  bivariate  distributions  under  the  alternative  hypothesis  are  close 
to  P2>  then  the  test  T  has  the  desired  properties  mentioned  above. 

More  specifically,  let  0  <  d  <  1  be  fixed.  Let  the  hypotheses  be 


H  :  P  =  P. 
o  1 

and 

H:  :  P  =  AP  +  (1-A)P2  with  0  <  A  <  d. 

The  hypotheses  Hq  and  are  well  separated.  We  can  choose  a  suitable 
c  which  figures  in  the  description  of  the  test  T  so  that  we  will  have 


1 


.V.Va'.v  .v.slv  -V»V  A 


,'n  -N  u*»  -V  -*»  Cpjw'iA 


J*.  K*.  ^  '•J  X.*  XT X.T.HT:  V*  V  X",  X."  V.  X’  V  x-  X”  V  x^xn  XH  xn  X^  X".  XTTX-I  XT.  -V-  XT  -■’■  v  XTVT 
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lim  N  -  0=  W  =  0 
and 

Um  N  -  cc  <V  P)  =  1 
for  every  P  under  H^. 

Remarks .  Computation  of  the  asymptotic  power  of  tests  based  on  affine  measures 
of  dependence  can  be  carried  out  in  the  same  vein  as  above  for  any  general  n. 


-21- 

REFERENCES 

AGRESTI,  A.  (1984)  Analysis  of  Ordinal  Categorical  Data.  Wiley,  Chichester,  U.K. 

BARLOW,  R.E.  and  PROSCHAN,  F.  (1981)  Statistical  Theory  of  Reliability  and 
Life  Testing.  To  Begin  With,  Silver  Spring,  Maryland,  U.S.A. 

BHASKARA  RAO,  M. ,  KRISHNAIAH,  P.R.  and  SUBRAMANYAM,  K.  (1987)  A  Structure 
Theorem  on  Bivariate  Positive  Quadrant  Dependent  Distributions 
and  Tests  for  Independence  in  Two-Way  Contingency  Tables.  To 
appear  in  Journal  of  Multivariate  Analysis. 

EATON,  M.L.  (1982)  A  Review  of  Selected  Topics  in  Multivariate  Probability 
Inequalities,  Ann.  Stat.,  10,  11-43. 

GOODMAN. L. A.  and  KRUSKAL,  W.H.  (1979)  Measures  of  Association  for  Cross 
Classifications.  Springer-Verlag,  Berlin. 

LEHMANN,  E.L.  (1966)  Some  Concepts  of  Dependence,  Ann.  Math.  Stat.,  37,  1137-1153. 

NGUYEN,  T.T.  and  SAMPSON,  A.R.  (1985)  The  Geometry  of  Certain  Fixed  Marginal 
Probability  Distributions,  Linear  Algebra  and  Its  Applications, 

70,  73-87. 


unclassified _ 

jtCjMI  T  i  L'L  AS'ilMI  A  1  ION  fMli  ►*  Af.L  f  iJmim  t  ••  l 


REPORT  DOCUMENTATION  PAGE 


I  HEPOBT number 

86-13 

4  TITLE  (tnil  Submit) 


J,rr  HEAD  INSTRUCTIONS 

AOC _ UEI-OKK  rOMIM-EJTNO  KORM 

2  GOVT  ACCESSION  NO  3  RECIPIENT'S  CATALOG  NUH8EP 


*  tyre  of  report  a  pehioo  covered 


On  the  extreme  points  of  the  set  of  all  2xn  technical  June  1987 
bivariate  positive  quadrant  dependent  distrib- 
utions  with  fixed  marginals  and  some  applications  86-13 


I  AU  T  HOR|»l 


K.  Subramanyam  and  M.  Bhaskara  Rao 


•  CONTRACT  OR  GRANT  NUMBER!*! 


»  PERFORMING  ORGANIZATION  NAME  AND  ADDRESS  >0-  PROCRAM  ELEMENT.  PROJTCT.  TASK 

Center  for  Multivariate  Analysis  AREA  *  W0,,K  UN|T  NUMBERS 

University  of  Pittsburgh,  515  Thackeray  Hall 
Pittsburgh,  PA  15260 

II.  CONTROLLING  OFFICE  NAME  ANO  AODRESS  12.  REPORT  date 

Air  Force  Office  of  Scientific  Research  June  1987 

Department  of  the  Air  Fcrdce  u.  number  of  rage's 

Bolling  Air  Force  Base,  DC  20332 _ _ _ _ | _ 

14  MONITORING  ACEnCY  name  *  ADORESSlII  dllltrtnl  Iran  Controlling  Olhct)  16  SECURITY  CLASS.  (ol  Ihlt  ifpofl) 

unclassified 


It.  OlSTHIBuTlON  STATEMENT  (ol  I  hit  Rtport) 


It  A.  DECLASSIFICATION/  DOWNGRADING 
SCHEDULE 


Approved  for  public  release;  distribution  unlimited 


17  Distribution  statement  (ol  tht  tbtlttcl  tnttitd  In  Bluet  ) 0,  ll  dllltrtnt  from  Rtport) 


II  supplementary  notes 


I*  KEY  WORDS  (Conflow*  on  «fd*  II  nocmttmry  and  hlotitlly  6r  block  numb*/) 

Asymptotics,  convex  set,  extreme  points,  hypothesis  of  independence,  Kendall's 
tau,  measures  of  dependence,  Pearson's  correlation  coefficient,  positive  quadrant 
dependence,  power  of  a  test,  Somer's  d,  Spearman's  Rho 


20  A8‘>1  RAC  T  ( Cunflnu*  on  f*v«r«*  «ld*  II  n*c*«*#rr  and  Idotttlly  by  block  nutnbmt) 

The  set  of  all  .bivariate  distributions  with  support  contained  in  { ( i ,  j  ) ;  i  =  1 , 2 

and  j  =  1,2 . n}  which  are  positive  quadrant  dependent  is  a  convex  set.  In 

this  paper,  an  algebraic  me  , hod  is  presented  for  the  enumeration  of  all  extreme 
points  of  the  convex  set.  Certain  measures  of  dependence,  including  Kendall's 
tau,  are  shown  to  be  affine  functions  of  convex  set.  This  property  of  being 
affine  helps  us  to  evaluate  the  asymptotic  power  of  tests  based  on  these  measures 
of  dependence  for  testing  the  hypothesis  of  independence  against  strict  positive 
quadrant  dependence. 


00  >473 


unclassif ied 

SECURITY  CLASSIFICATION  Of  This  PAGE  !»*•>,  Dttt  E  n  t  »t  tj ) 


